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Abstract 

In this paper we investigate the kernel estimator of the density for a stationary reversible 
Markov chain. The proofs are based on a new central limit theorem for a triangular array of 
reversible Markov chains obtained under conditions imposed to covariances, which has interest 
in itself. 

1 Introduction and main results 

Eor estimating the marginal density for dependent sequences the dependence structure plays 
an important role. One possible estimator is the kernel estimator introduced by Rosenblatt 
(1956a). In general the dependence is imposed in terms of mixing conditions (Bradley, 1993, 
Bosq et al, 1999 among many others), in terms of coupling coefficients for functions of i.i.d. (Wu 
et al, 2010) or positive association of random variables (Lin, 2003). 

In this paper we study the kernel estimator for reversible Markov chains. It is well known that 
for strictly stationary reversible Markov chains the covariances can be viewed as a measure of 
dependence (see Kipnis and Varadhan, 1986). When estimating the density via kernel estimators 
we introduce a triangular array of random variables which is only row-wise stationary. This 
makes it difficult for studying the kernel density of the marginal distribution for reversible 
Markov chains without imposing recurrence conditions. As a matter of fact results on the kernel 
estimators for marginal density of reversible Markov chains are very rare. We noticed only 
the paper by Lei (2006) dealing with large deviations results for the integrated error of the 
kernel density estimators for reversible Markov chains. The class they considered is of reversible 
irreducible Markov chains with the transitions satisfying a uniform integrability condition in 
square mean. However their result cannot be applied when studying the density at a point or 
several points. In this paper we develop tools that make this study possible. 

Let {Xn)nez be a stationary reversible Markov chain with marginal distribution vr(A) = 
P{Xn G A), for all Borel sets A. For a stationary Markov chain the reversibility means that the 
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distribution of (Xo,Xi) is the same as of (Xi,Xo). Assume that vr has a marginal density /(x), 
continuous at x. We shall consider in this paper the Rosenblatt (1956a) estimator of density 
defined by 


fn{x) 


1 

nbn 


n 




X- Xk. 


( 1 ) 


where bn is a bandwidth converging to 0 and AT is a kernel, a known density function. 

The problem considered in this paper is the consistency and the speed of convergence of the 
kernel density estimator of the density at several points {xj)i<j<m which will be given via a 
multivariate CLT. 

To treat the problem we shall use the following notation 


Hkiu, v) = P{Xq > u,Xk > v) - P{Xo > u)P{Xk > v) (2) 


and we shall denote 


Vk = 


The condition we shall impose to is 


Hk{u, v)\dudv. 


- kn{ky 


( 3 ) 

( 4 ) 


where l{x) is a function increasing to infinity such that for any positive k, limx^oolikx)/l{x) = 1 
(slowly varying at infinite). 

The following condition is imposed to the joint density of the vector (Xq, A 2 ) and a family 
of points of interest ixj)i<j<m- there exists the joint density / 2 (x, y) of (Xq, A 2 ) which is locally 
bounded around any pair {xi,Xj)i<ij<m in the sense that there exists a constant M and a 
constant Cm (both depending on {xi,Xj)) such that 


sup \f2{xi +a, Xj+ a)\ < Cm- (5) 

\a\<M 


This condition is weaker than the condition which is usually imposed in the dependent cases 
which requires that all the densities of vectors (Xq , Xj ) are uniformly bounded on (see 
condition in Bosq, 1998, or in Bosq et al, 1999). Local conditions can be found for instance in 
papers by Liebscher (1999) and Dedecker and Merlevede (2002). 

All along the paper, we assume that the kernel K satishes the Condition C below: 

(Cl) K is symmetric decreasing on (0,oo) and f K{u)du = 1. 

(C2) x^K{x) —0 as X —>• 00 . 

(C3) K is differentiable with K'{x) bounded. 

Note that a normal kernel will satisfy all these conditions. The convergence in distribution 

p 

will be denoted by =>, and —)• denotes the convergence in probability. 

The main result of this paper is the following: 


Theorem 1 Let {Xj)j^i be a stationary reversible Markov chain with marginal density function 
/(x) satisfying condition Assume that the bandwidth bn in the estimator m satisfies nbf —)• 
00 and the kernel K satisfies Condition C. Then, at any points xi,--- ,Xm where /(x) is 
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continuous, different of 0 and the joint densities satisfy condition we have 


fnjXj) - EfnjXj) 

Sfn{xj) f K^{u)duf/^ 

where Im is the identity matrix. 


, 1 < j < m ^ iV(0,/m), 


It is well known that if the density is twice continuously differentiable at Xj then the bias is 
of order (see Hardle 1991, relation (2.3.2)) 


^{in{Xj)) - f{xj) = ^f'fxj) + o{bl) as bn 0. 
By combining this result with Theorem [T] we get the following corollary: 


Corollary 2 In addition to the conditions of Theorem [IJ assume that f is twice continuously 
differentiable at {xj)i<j<m o,nd nb^ —>■ 0. Then 



fnjxj) - f{Xj) 
ifn{xj) f K‘^{u)duy/^ 


1 < j < m 




Let us comment about the dependence coefficient used in our results defined in Q. 

If we have positive dependence, in the sense that Hk{x,y) > 0 for all {x,y) G M^, then by 
the Lehmann Lemma (see Newman, 1980) we have 


m = cov(Xo,Xk). 


This coefficient can also be controlled by a pairwise mixing condition which is weaker than 
strong mixing coefficient introduced by Rosenblatt (1956b). As in Rio (2000) relation (1.8a), 
define 

Ok = ak(Xo,Xk) = 2 sup IIIk(x,y)l 
(x,y)GR^ 

Theorem 1.1 in Rio (2000) states an estimate of the covariance between Xq and in terms of 
Qffc. However, in the proof, the author actually estimated r]^. Therefore we have 




Vk S 


< 2 



u)du, 


where Q\Xq\ is the quantile function of |Ao|, i.e. the generalized inverse of the function P(|Ao| > 
t). 

In particular, for 5 > 0 

n ( 2 +< 5 ) 11 ^ 112 

Vk<2af^ IIA0II2+5; 

and if ||Ao||oo < 1 then rjk < 2ak- 

Recall that, in the stationary setting, the Rosenblatt pairwise strong mixing coefficient, for 
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an integer A; > 0, is defined by 

ctfc = Ofc(Xo, Xk) = sup |IP(Xo G A, Xk G B) — P(Xo G A)(Xk G -6)1, 

A,B€l3 

where B denotes the Borel sigma algebra on the line. Clearly Ok < 2ak- In terms of these mixing 
coefficients we make the following remark. 

Remark 3 Theorem [7] also holds if we replace eondition & by Efc>i < oo. 

It should be noted that this strong mixing rate was already pointed out in Bosq et al. (1999), 
without assuming reversibility. The advantage here is that we can have the asymptotic normality 
of the kernel estimators by only requiring a condition on the joint density of the vector (Xq, X2) 
and not on all the joint densities. 

We finish this section by mentioning a few notations which will be used in this paper. The 
largest integer smaller or equal to x will be denoted by [x]. By Cn <C we understand that 
Cn < Cdn for some C > 0 and all n; we denote by a V 6 the maximum between a and b. 

2 Technical results 

The proof relies on the properties of reversible Markov chains. We shall point first some mono¬ 
tonicity conditions for integrals of functions of reversible Markov chains. The regular conditional 
probability of Xi given Xq will be denoted by Q{x, A) = P(Xi G A\ Xq = x). Let Q also denote 
the Markov operator acting via {Qf){x) = Jg f{s)Q{x,ds). Next, let p 2 ('^) be the set of mea¬ 
surable functions such that f g'^dn < 00 and f gdir = 0. In operator terms the Markov chain is 
called reversible if Q = Q*, where Q* is the adjoint operator of Q. 

For some function g G L2('7r), let 

Yi = g{Xi). 

Denote Xn = (j{- ■ ■ Xn-i,Xn)', = P(I^|7u;)- It is well known that, by the definition of 

Markov chains, for Y G a{Xi,i > 1) we have = P(7^|-^o) = Pxo(7")- From the spectral 
theory of self-adjoint operators on Hilbert spaces (see for instance Rudin, 1991), it is well known 
that for every g G IL 2 ('^) there is a unique transition spectral measure v supported on the 
spectrum of the operator [—1,1], such that 

E(Eo(yi)Eo(7^i)) = l'^s^+^n{ds). (6) 

By using this representation we give the following lemma which relates the conditional expec¬ 
tation with the covariances for functions of reversible Markov chains. It also points out several 
monotonicity conditions for the covariances. 

Lemma 4 For stationary reversible Markov chains and every positive integers k,j we have 


E(EoyfcEoy,-) = E{YoYk+j). 


For any integer k > 0 


E{Yo{Y2k + Y2k+i))>0, 


( 7 ) 

( 8 ) 
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also, for any integers j and k such that 0 < j < k 


E(yoy 2 ,) > E(yoy 2 fe) > o. 

(9) 

and for integers k >2 


E{YoYk) < E{YoY2). 

(10) 

For any positive integer i and any j > 21 


j 

Y ]E(yoyfc) > 0. 

(11) 


k= 2 e 


For any positive integer i and any n >2i 

j n 

^max^ Y, nYoYk) < Y ^(YoYk) + nYoY2e). (12) 

Proof. To prove relation ([7|) just note that KiYoYk+j) = E(lolEoTfc+j) and apply relation 
(jU]). Then, note that 

E{Yo{Y2k + Y2k+i)) = + s^'^+^Mds), 

and s^^(l + s) > 0 for all — 1 < s < 1; so relation ([ 5 ]) holds by Relation Q is clearly true 
since for 0 < j < fe, we have > 0 for all s. Finally in order to show relation (|10p just 

note that for all — 1 < s < 1 and any integer k >2 we have s^ < which we combine with ([U|). 

Relations m and (fT^ are obtained via a blocking argument. Now, if j is odd, say j = 
2m + 1, we can write 


j=2m+l m 

Y HYoYk) = Yi^(YoY2k) + E{YoY2k+i)) > 0, 

k=2e k=i 

since, by relation ([8]) in the right hand side we have a sum of positive terms. 
On the other hand, if j is even, say j = 2{m + 1) 


j=2m+2 2m+l 

Y nYoYk) = E(yo>^ 2 („^+l)) + HYoYk). 

k=2i k=2£ 

By relation ([9]), E(yoy 2 (m+i)) ^ 0- Therefore IfTTI) is true for all j > 2i. 

Now, if n > 2i, by combining the latter considerations with ([9]) and (jlOl) 


j 


max 

2£<j<n 


Y ^(YoYk) 

k= 2 £ 


< 


2j 


max 

2£<2j<n 


Y ^YoYk) V 
k=2e 


2m+l 


max 

2£<2m+l<n 


nYoYk) 

k= 2 i 


2m+l mn 

< max V E{YoYk) + max E(yoy 2 j) < nYoY 2 i) + V E(yoyfc). 

2l<2m+l<n ^ 2e<2j<n 

- k=2£ - k=2i 
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where is the largest odd integer smaller than n. If n is odd n = If n is even, n > 21', 
then rUn = n — 1. By taking into account relation Q we can add in this case a positive term, 
E(yok^); and obtain overall relation ()12ll . □ 


Next, we give a CLT for a triangular array of row-wise stationary reversible Markov chains. 
The conditions for the CLT are imposed to the covariances of both the variables and their 
squares. 

Theorem 5 Let (Xj)jgz he a stationary reversible Markov chain. For real functions fn, define 

X^^k = fn{Xk). (13) 

Assume that 

^^n,k < ^Xn,k = 0 and E(x2 o) <t^, (14) 

n 

COY{Xnfi, Xn,2) + ^ COv(X„^o, -^n,fc) “S' 0, (15) 

k=2 

and 

1 "■ 

-(var(x2 o) + Y^cov{Xl^Q,Xl^J) 0. (16) 

u=0 

Then 

1 

Proof. We start from a standard martingale decomposition by using projections: 

n n 

Sn — 'y ^ (^n,fc ^k—lXn,k) T ^ ^ l^n,fc 
k=l k=l 

n n 

— ^ L)n,k T 'y ''^ ^k—lXn,k‘ 

k=l k=l 

Note that Dn,fc = Xn^k — ^k-iXn,k are martingale differences adapted to (Tfc)^>;^. 

We show first that the second term divided by ^/n is negligible for the convergence in distri¬ 
bution. To show this we estimate var(^^^^ fc). By using the properties of conditional 

expectation, stationarity and relation ([7]) in Lemma [H for k < j we obtain 


^{^k-lXn,k^j-lXnj) — K{XnjEk-lXn,k) — ^{XnJ-k+l^oXn,l) 

= E(Eoyn,lEo^nj-fc-|-l) = ^{Xn,oXn,j-k+2) 
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and so. 


n n j —I 

E(^ Ek_iXn,k)‘^ = n ■ cov{Xn,o,Xn, 2 ) + 2 EE E(^n, 0 ^n j—fc+ 2 ) 

k=l j=2 k=l 

j 

< 2n max E(X„ o-^n,fc)- 
2<j<n ^^ 

“ k=2 

By applying now relation ()12ll of Lemma 0] 

^ n n 

—— 2E(^n,0-^n,2) + 2 E{XnfiXn^k) ^ 
k=l k=2 

which converges to 0 by (|15p . 

Now we analyze the martingale differences via Theorem [6] given in Appendix. To show that 
maxi<fc<„ \Dn^k\/V^ is uniformly integrable we show that E(maxi<fc<„ < Cn, for all n 

and some constant C > 0. Indeed, since E(L)^g) < E(A^q), by (fTT]) we note that there is a 
positive constant C such that E(D^ o) — i-i* therefore, by stationarity 


It remains to verify 


We note that 


n 

E( max Dl k) < ^E(d2 ) < Cn. 

l<k<n 

- k=l 


[nt] 


-Y^D. 

n ^ 


2 

n.k 


ta" 


k=l 


^n,k — ^n,k + ~ ‘^^n,k0^k-l^n,k) — ^n,k + ^n,k- 

Furthermore, by the Cauchy-Schwartz inequality, (I14p . Lemma [Hand stationarity 

1 n 1 ^ 

\In,k\ = — KEk-lXn^k)^^ - 2Xn,k{^k-lXn,k)\ 
k=l k=l 

2 ^ 

< E((E_iA„ o)^) H— Y] ||A'„ fc||2||Efc_iA„ fc||2 
k=l 

< COv{Xn,0,Xn,2) + C^JYvXXXX)■ 

We see that the last quantity converges to 0 by condition (jl5|) combined with relation m in 
Lemma m We also note that by stationarity and (|14l) . 

-E(E-’^L) = iEE(xJ„) ^ 
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So, it remains to show that 


[nt] 


-E(^n,fc-E(X2 o)) Ao, 

n ^ 


which will be implied by 


fc=i 


^ [nt] 

var(-^xA) ^ 0. 


k=l 


We estimate now this variance. Note that by relation (1121) in Lemma 01 

.. 

k=l k=2 u=l u=0 

2t 


\nt] [nt] k—1 rc 

^ ^var(XA) + max ^ cov(A,o> ^n,«)) 

i ^ ’ Ti^ < ^ ^ (\<rh<rrn < ^ ’ ’ 


< 


n 


(var(XA) + X] cov(xA, Xl^u))- 


w=0 


The result follows by condition (I16p . □ 


3 Proof of Theorem [T] 

By the consistency of fn{xj) due to the continuity of f{x) at Xj and the assumptions we made 
on the bandwidth and kernel we only need to show that 


vxJ, 


By the Cramer-Wold device, it suffices to prove that 

/TT-^ fniXj) - ^fniXj] 

Vnbn^^Aj^f^x,)fKHu)duy/^ - 

for arbitrary fixed Ai, • • • , Am. Here Zi,l < j < m are i.i.d. standard normal random variables. 
Let Sn = J2i=i Xn,i where 


A 


1 


- ,/TrYl 


A. 


VK ^ ifixj) f K^(u)duy/^ bn 
We shall verify the conditions of Theorem [5l We verify hrst (I14p . 


(if(_ EiL(A^)). 


var 




bn At f K‘^{u)du 


var(A(^^-—-)) 


^ m—1 m 

4 e e 




f xyu)du 


cov(iL(^Ar^), = ^n + Iln- (17) 















Now, by Bochner’s lemma (see Parzen, 1962, or Bosq, 1998) and the fact bn 0, 


lim = f{xj) [ K‘^{u)du. 

n^co y/bn bn J 


Therefore 


m 

In ^ 

1=1 


On the other hand by simple calculus computations involving the symmetry of for j ^ p we 
have 


1 ( (^3 ^ 0 ' 

1 


,k C\ ^° )) 


Ki^^)K{^^)f{u)du-y I K{^^)f{u)du I K{^^)fiu)du 

= J K{v)K{v + ^ )f{xj - bnv)dv - bn j K{v)f{xj - bnv)dv J K{v)f{Xp-bnV)dv. 

Clearly the second term is convergent to 0 by Bochner’s lemma and the fact that bn —)• 0. For the 
hrst term we cannot apply directly the Bochner lemma, but by using the same arguments as in 
its proof presented in (Parzen, 1962) along with the Lebesgue dominated convergence theorem, 
under our conditions we deduce that this term is also negligible. Hence Var(Yn^i) —)• 

To verify condition (I16p we introduce the function 


Ku) = 


A,- 


^ {f{xj)fK^{u)duy/^ 


_ EiF(^i-^)) 


Since by our conditions on K, the function g{u) has bounded derivative, by Newman extension 
of Hoeffding lemma (see relation (22) in Newman, 1980), 

coviYnfi,Yn^k) = ^ j j g'{u)g'{v)Hk{u,v)dudv. 

Therefore, with C = iYlJLi f K‘^{u)duY/‘^Y we obtain 


k=l 


,Vk, 


k=l 


which converges to 0 by taking into account our conditions on bn and gt- 

We have also to treat var(y^g)/n. We shall apply first Holder inequality to obtain 
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Note that for any p > 1 


^ = ^n j KP{u)f{x + bnu)du. 

So by the Bochner’s lemma and the fact that the kernel is bounded 
1 


n 


var(y^Q) < max f K*{u)f{xj + bnu)du H—6^( f K{u)f{xj + bnu)du)^ 0 

’ i<j<m nbn J n J 


provided nb^ —>■ oo. Therefore (fT6]l is satisfied. 
We turn now to verify condition (jl5jl . Since 


COv(Ynfi,Yn,k') — , ^ ^ 


XjXp 


,Xi - Xn 


. Xt) Xh 


bn {f{xj)f{xp))^/‘^jK‘^iu)du 
it is enough to show that for any j and p fixed 




. X i Xr) 


,Xp - X2. 


-cov{K{^—- - ),K{ ^ > 0 


(18) 


and 


^ cov{K{ 


Xj - Xo^ 




(19) 


k=2 


We shall estimate cov{Xn,o, X^^k) in two different ways and take the minimum of these estimates. 
By Lemma m for k > 2 


A = cov(iL( 


Xi-Xo ^ ^^,Xp-Xk^^ ^ r^,Xj - Xq. 


),KC- 


-)) < cov(iL(- 


,Ki 


Xp - X2 


)) 


• _ 7/ -T* _ 7J 

K{^ -- ){f2{u,v) - f{u)f{v))dudv. 

On On, 


By changing the variable 


A — bn 


K{u)K{v){f2{xj + ubn,Xp + vbn) - f{xj + ubn)f{xp + vbn))dudv (20) 


< K 


K{u)K{v){f2{Xj + ubn, Xp + vbn)dudv. 


To analyze this term we divide the integral in (j20p on 4 sets 

(|u — Xj\ < M) X (|u — Xp\ < M) 

{\u — Xj\ > M) X (|u — Xp\ < M) 

(|rt — Xj\ < M) X (|u — Xp\ > M) 

(|n — Xj\ > M) X (|u — Xp\ > M). 
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On the first set, (|ti — Xj\ < M) x (|?; — Xp\ < M), we change the variable and obtain 

M M 

I I bxi 

bn / K{u)K{v){f 2 {xj - bnU,Xp - bnv)dudv < 

J-M J_M 




< bn sup \f 2 {xj + a,Xp + a)\ f / K{u)K{v)dudv. 

\a\<M 


By our assumptions this term is smaller than bnCM- On the set {\u — Xj\ > M) x {\v — Xp\ < M) 
we have 

- Xj\ > M)I{\X2 - xp\ <M)< WKW^^kA. 

A similar estimate is obtained on the set (|u — Xj\ < M) x (|u — Xp\ > M). On {\u — Xj\ > 
M) X {\v — Xp\ > M) we estimate in the following way 


' E(/f -x,\> M)I{\X2 - 1,1 > M) < 

So, for n sufficiently large 

^cov(A:( ^^ < bnC{xj,Xp) (21) 

where 

C{xj, Xp) = Cm + 2|l-^lloo'^A'(— ) + 

K bn bi bn 

By our conditions on K for n sufficiently large C{xj,Xp) is bounded. 

So clearly by (l2T]) condition (fTS]) is satisfied. 

On the other hand. 


1 

¥ 




0 ^ 


)) 


K‘ 


,,Xn — U, ^^,,Xn — V. , , , , ,, ^,,,,9 1 

^ 7 -(^T- )Hk{u,v)dudv <\\K 

^ri 


So by combining the estimates in (I2ip and (|22l) we have proven that 


( 22 ) 


bn bn bn 

< mm{bnC{xj,Xp),c-jCov{Xo,Xk)) <C min(fi 


(23) 


To continue we use the estimate from (j23[) to bound the sum in the right hand side of (I19p . We 
shall divide the sum in two, up to rUn and after uin- This sequence of positive integers will 
be selected later. On the first part of the sum we use the bound of order bn and on the second 
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part of the sum we use the bound r]k/bf^. So, by the properties of slowly varying function I, 


— ^coy{K{ 


Xj - Xo 


k=2 


Y" It' 

K E ii’ft 

k=2 fc=m„+l ^ 

, 1 
< rrinOn + 


hlmll {nin)' 


To optimize the sum we take 


vrin = [max( 


vs;’ 6J1/6 (i/6V2) 


)] + !■ 


Clearly runbn —t 0 (since bnl^/h^ —t 0 and l/l^/^{l/b]l‘^) —> 0). 

Since > bn ' and I is increasing we have l{mn) > l{bn ' ) and so, since rUn > 

{bnl^/^{l/bl!^))-\ 


< 




- 1 / 2 , 


l{bn^^^) 


Kl/bl/^) 


0 as n —)> oo. 


By taking now into account Lemma [H it follows that the sum in (1191) is positive and then 
m follows. Now by (jlSp cnid (jl9p wo coiicIucIg tli9,t condition m is satisfied and the result 
follows. □ 

Proof of Remark [S]. To prove this remark we have to replace in the proof of Theorem [1] 
relation (j22|) by relation (3.12) in Bosq et al. (1999), namely 

— |cov(iL(^-- ),K{^ -)| < —\\K\\^ak. 

Then, we replace relation (1231) by 

—cov{K{-^ -),iL(-(---) < mm(6„, j-ak), 

Or), O77, 


and follow the proof from the page 88 in Bosq et al. (1999), to obtain the result of this remark. 


4 Appendix 

Martingale limit theorem (Ganssler and Hausler, 1986, pages 315-317). 

Theorem 6 Assume {Dn,k)i<k<n is a triangular array of martingales adapted to an increasing 
in k filtration Fn,k- Assume Y^k=i k ^ 

max \Dn,k\ is uniformly integrable (24) 

l<k<n ’ 

(as before, by [x] we denote as usual the integer part ofx). Then ^ aW{t) where W{t) is 
a standard Brownian measure. In particular Sn X(0, cr^). 
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